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METHOD FOR OBTAINING D.N. A. R.K.A. PEPTIDES 
POLYPEPTIDES OH PROTEINS OF A D.I.'.A. RECOMBINANT 
TECHNIQUE 



The present invention has as its object a process for 
obtaining DNA, RNA, peptides, polypeptides or proteins, 
through use of transformed host cells containing genes 
capable of expressing these RNAs , peptides, polypeptides, 
or proteins; that is to say, by utilisation of recombinant 
DNA technique. 

The invention aims in particular at the production of 
stochastic genes or fragments of stochastic genes in a 
fashion to permit obtaining simultaneously, after 
transcription and translation of these genes, a very large 
number (on the order of at least lO'OOO) of completely new 
proteins, in the presence of host cells (bacterial or 
eucaryotic) containing these genes respectively capable of 
expressing these proteins, and to carry out therafter a 
selection or screen among the said clones, in order to 
de-termine which of them produce proteins with desired 
properties, for exemple structural, enzymatic, catalytic, 
antigenic, pharmacologic, or properties of liganding, and 
more generally, chemical, biochemical, biological, 
properties . 

The invention also has as its aim procedures to obtain, 
sequences of DNA or RNA with utilizable properties notably 
chemical, biochemical, or biological properties. 

It is clear, therefore, that the invention is open to a 
very large number of applications in very many areas of 
science, industry and medicine.' 

The process for production of peptides or polypeptides 
according to the invention is characterized in that one 
produces simultaneously, in the same medium, genes which 



are dt least partially composed of synthetic stochastic 
polynucleotides, that one introduces the genes thus 
obtained into host cells, that one cultivates 
simultaneously the independent clones of the transformed 
host cells containing these genes in such a manner so as to 
clone the stochastic genes and to obtain the production of 
the proteins expressed by each of these stochastic genes, 
that one carries out selection and/or screening of the 
clones of transformed host cells in a manner to identify 
those clones producing peptides or polypeptides having at 
least one desired activity, that one thereafter isolates 
the clones thus identified and that one cultivates them to 
produce at least one peptide or polypeptide having the said 
property . 

In a first mode of carrying out this process, stochastic 
genes are produced by stochastic copolymerization of the 
four kinds of deoxyphosphonucleotides , A,C,G and T from the 
two ends of an initially linearized expression vector , 
followed by formation of cohesive ends in such a fashion as 
to form a stochastic first strand of DNA constituted by a 
molecule of expression vector possessing two stochastic 
sequences whose 3 1 ends are complementary, followed by the 
synthesis of the second strand of the stochastic DNA. 

In a second mode of earring out this process, stochastic 
genes are produced by copolymerization of oligonucleotides 
without cohesive ends, in a manner to form fragments of 
stochastic DNA, followed by ligation of these fragments to 
a previously linearized expression vector. 

The expression vector can be a plasmid, notably a 
bacterial plasmid. Excellent results have been obtained 
using the plasmid pUC8 (see Vieira & Joachim, Gene, 19, (19a2), 
259-268) as the expression vector. 

The expression vector can also be viral DNA or a hybrid of 
plasmid and viral DNA. 



The host cells can be prortaryotic cells buch as HE 101 
(American Type Culture Collection No. 23724) and C 600 (American 
Type Culture Collection No. ??694 ) , or eukaryotic cells. 
When utilizing the procedure according to the second mode 
mentioned above, it is possible to utilize 
oligonucleotides which form a group of palindromic 
octamers . 

Particularly good results are obtained by utilizing the 
following group of palindromic octamers: 

5 ' GGAATTCC 3 ' 

5 ' GGTCGACC 3 ' 

5 ' CAAGCTTG 3 ' 

5 ' CCATATGG 3 ' 

5" CATCGATG 3' 

It is also possible to use oligonucleotides which form a 
group of palindromic heptamers. 

Very good results are obtained utilizing the following 
group of palindromic heptamers: 

5' XTCGCGA 3' 

5' XCTGCAG 3' 

5' RGGTACC 3' 
where X = A,G,C or T, and R = A or T 



According to a method to utilize these procedures which is 
particularly advantageous, one isolates and purifies the 
transforming DNA of the plasmids from a culture of 
independent clones of the transformed host cells obtained 
by following the procedures above, then the purified DNA is 
cut by at least one restriction enzyme corresponding to 
specific enzymatic cutting site present in the palindromic 
octamers or heptamers but abserft from the expression vector 
which was utilized; this cutting is followed by 
inactivation of the restriction enzyme, then one 
simultaneously treats the ensemble of linearized stochastic 
DNA fragments thus obtained with T4 DNA ligase, in such a 
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manner to create a new ensemble of DNA contaimg new 
stochastic sequences, this new ensemble can therefore 
c ,„„ a .; r a number of stochastic genes larger than the number 
of' genes in the initial ensemble. One then utilizes this 
new ensemble of transforming DNA to transform the host 
cells and clone these genes, and finally utilizes screening 
and/or selection and isolates the new clones of transformed 
best cells and finally these are cultivated to produce at 
least one peptide or polypeptide, for example, a new 
protein . 

r- ^v,o rritprion for selection of tne 
The property serving as the criterion 

<= ^-1^ can be the capacity of the peptides or 

clones of host ctrl^s can l-<~ j 

. , _ i-n ratalvse a qiven 

chemical reaction . 

For instance, for the production of several peptides and/or 
polypeptides, the said property can be the capacity to 
catalyse a sequence of reactions leading from an initial 
group of chemical compounds to at least one target 
compound . 

With the aim of producing an ensemble constituted by a 
plurality of peptides and polypeptides which are 
reflexively autocatalytic , the said property can be the 
capacity to catalyse the synthesis of the same ensemble 
from amino acids and/or oligopeptides in an appropriate 
milieu. 

The said property can also be the capacity to modifiy 
selectively the biological or chemical properties of a 
given compound, for example the capacity to selectively 
modify the catalytic activity of a polypeptide. 

The said property can also be the capacity to stimulate, 
inhibit, or modify at least one biological function of at 
least one biologically active compound, chosen, for 
example, among the hormones, neurotransmitters, adhesion 



factor,, growth factors and specific relators of DNA 
replication and/or transcription and/or translation of 

RNA . 

The said property can equally be the capacity of the 
peptide or polypeptide to bind to a given ligand. 

The xnvent.on also has as its ob-ect the use of the peptide 
or polypeptide obtained by the process specified above, .or 
the detection and/or the titration of a ligand. 

According to a particularly advantageous mode of carrying 
out the invention, the criterion for selection of the 
ric-e* of transformed host cells is the capacity of the.e 
l^e* nr polypeptides to simulate or nocify the effects 
of* a biologically active molecule, for example, a protein 
and screening and/or selection for clones of transformed 
host cells producing at least one peptide or polypeptide 

i ^v-rt^^r^nQ 



having this property, is carried out by prepar 
antibodies against the active molecule, then uti izing 
these antibodies after their purification, to identify the 
clones containing this peptide or polypeptide, then by 
cultivating the clones thus identified, separating and 
purifying the peptide or polypeptide produced by these 
clones, and finally by submitting the peptide or 
polypeptide to an in vitro assay to verify that it has the 
capacity to simulate or modify the effects of the said 
molecule. 

According to another mode of carrying out the process 
according to the invention, the property serving as the 
criterion of selection is that of having at least one 
epitope similar to one of the epitopes of a given antigen. 

The invention carries over to obtaining polypeptides by the 
process specified above and utilizable as 
chemotherapeutically active substances. 



In particular, in the case where the said antigen is BCF. 

L invention pewits obtaining poiypeptides .sable for 

± ^ fvc^-.pnl of epitheliomas, 
cher.otherapeut ic treau.ieni 01 

According to a variant of the procedure , one identifies 
lis the clone, of transformed host cells producing 
peptides or polypeptides having the property desired, b> 
affinitv chromatography against antibodies correspon^.g ~ 
a protein expressed by the natural part of the DNA hybrid. 

For example, in the case where the natura! part of th. 
hybrid DNA contains a gene expressing £ -ga lacto ^ c 

Y clv iden-^v and isolates the saic clones o. 
ran advantageously iden^\ <*..u ^ 
. ^ k„ affinitv chromatograpny again^. 
transformed host celxs ^ - - 

anti- 6 -gaiactosidase antibodies. 

M ter expression and purification of hybrid peptioes o 
P "ypeptides, one can separate and isolate their novel 
parts . 

, - D -n e c tn a use of the process 
The invention aiso «FF- e - — - 

specified above for the preparation of a vaccine the 
■ ■ characterized by the fact that antibodies 
application is characterise, . ev „™ lc 

against the pathogenic agent are isoiateo ~. e t 
antibodies forced after injection of the P^-ic a e 
in th e body of an animal capable of for.mg antibodies 
against this agent, and these antibodies - " ° 
identify the clones producing at least one protein h ving 

t least one epitope similar to one of the epitopes of 
pathogenic agent, the transform host ceU correspon g 
to these clones are cultured to produce these protein 
thi s protein is isolated and purified fro. the clones of 
Til., then this protein is used for the production of a 
vaccine against the pathogenic agent. 

For example in order to prepare an anti-HW vaccine one 
ear, extract and purify at least one capside protein of the 
H8 V virus, in 3 ect this protein into an ani.al capable o 
£ormi „g antibodies against this protein having at least 



■ . lar to one of the epitopes of the HVB vuus, 
epitope similar " one transformed host cells 

then cultivate the clones ^ 

^ thpse clones in a manner to v 
correspo.,-,,, ~ - - n from culture of 

pro tein -a » > ^ the prQtein for the 

these clones of cells 

anti-HBV vaccine, 
production af an anti 

process according to contains 
in bacteria such as Esche _ aa lactosidase , nor 

neither the natural gene expressing i ^ 

t v.. at is to say, Z 
the EBG ,.„. «...t „ o£ x ,.i and 

ttansfornea cells «re posi tivc- fcr 

«• IPTO in the »*. . - ^ th e 

of host cexj-=» i-o- 

one peptide cr polypeptice. 



^^prion for selection o. t..e 
The property serving as tn« capa city of the 

transformed host cells can also be 
^vp.otides or proteins produced by the 
clones to bind to a given compound. 

d can be in particular chosen advantageousely 
This compound can be f orote ins, notably among 

among peptides, polypeptides ana DNA. 
proteins regulating the transcription activity 

h , the said compound can also be chosen 
On the other hand , the saiu 

among DNA and RNA sequences. 

T.n as its object those proteins which 
The invention ^* ""^^J thB property serving as 
are obtained in the case whe transforme d host 

criterion of se^c- » of £ c - ^ bind 

cells consist in the capacity lon activity 

to regulatory proteins controlling transcrip 
of the DNA , or else to DNA and RNA sequences. 



The xn - r « 7 i ;;r;i r :::„;i; 1: 

protei „ which is ^^ZXJ^ =o»«omn, 

above mentioned, a, a — /- neighboring DNA 

replication or transcr ipt a on <u 

sequence . 

„ ., fhe aim o£ the invention also includes 

On the other hand, the aim or ^ 

utilization of proteins obtained in the „ 

mentioned to modify the properties of t ,„ 
replioation of a sequence of DM . - a cell cent... . 
seLnce of DNA , and expressing this prot^i... 

v. • ^ we ii a process of 

v, ac as its object as *e-- a f 
The mventia; — qimu ltaneous production 

nroduction of ON A , characterized ny — _ ^ 
In the same medium of genes at least partly — 

, KoHr nnlvnucleotides, in that rue y 
stochastic synthetic poly ^ produce an 

thus obtained are xntrou^ed ^ 

ensemble of transferee host .e - , ^ 

^ rhis ensemble is Carrie- 
and/or selection on this ^ 

k^c^ rplls contaminy 
indentify those host cells des ired 

nNA havmq at least one 

the T » — o £ 

host cells thus identified is isolated. 

stochastic synthetic P^^^' ^ t / P rod„ 1 an 
thu s obtained are introduced into host cells to p 
ensemble of transformed host ceils, in that » 
so p roduced are cultivated ^ « ^ ^ . 

snd, °: ; el i::::u; th r h : S t cu. — 

in that the RBA is isolated from the hos. 

identified. hind a given 

The said property can be the capacit y to bind 
compound , which might be for exampie a ,- r 



* - n nr also the capacity to catalyse a 



rn; 



^^r, tn the invention will be 
No w the process according to the ^ ^ 

described in more details, as ^ ernbodim ents . 

applications, with reference to non limitative 

F , rst w „ .hall describe particularly useful procedures to 

LV-c syndesis of stochastic genes, ana the 

C3rry C nose genes in bacteria to produce clones 

introduction of those gc.it^ 

of transformed bacteria. 

D pirecj L _svo 1 th J ^ 

a) Linearization of the vector 

ir 13 _„i P c: the 
c approximately 10 niui^u.e. ~~ - 

rnrfi flXDre c:cion vector are linearized dv 

s ; C with 100 units of the Pstl restriction enzyne 

■"I volume of 300 1 of the appropriate standard bufter. 

in a volume of 3 ^ 

The linearized - ct « " .„ volume of 30 1 

then precipitatea in etna:..., i- buffe r. 

^ = n r* aaarose qel in standard TEB DUitej.. 

and loaded onto a 0,8% agarose y 

• .-nn in a field of 3V/cm for three hours, the 

rr.:;:; :::,:; \: ...c^-. - 

ethanol, and taken up in 30 1 of water. 

b) stochastic synthesis using the en^e Terminal 
Transferase (TdT) 

30 ug of the linearized vector are reacted for 2 hours at 

„ f TdT in 300 1 of the appropriate 
37'C with 30 units of TdT xn 30 ^ ^ 

buffer, in the presence of 1«M ' ^ .„ 

and 1 mM dATP . The lower concentration of oTTP is 

order to reduce the frequency o£ stop alth0 „ gn 
correspond^ messenger «. * -Uar res i ™ 

somewhat less favorable, can be obtained y 
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• fc - dATP than for the other 

lower concentration fc. <L ^ ^ 

-oxynucleotide r el I ^ ty of the P.tl sites U 

polymerization on the 3 e ^ ^.^ the 
followed by analysis on a gel ot 
course of the reaction. 

stairs or passes a mean value of 300 
When the reaction attains P ^ the 

npr 3* extremity, it ii> * L 
mCleOUte : aa ;! P are sepsr ,ted fro. the poiymer by 
free nucleotides are sep age over a colunn 

differential precipitation or y F 60 (HegiBt ered 

containing a molecular sieve such as ln ethanol , 

- i \ a fi-pr concentration by precipe 
Trade MarK). co polyme ri Z ation wxth 

the polymeres are subjected t ^ q£ 

TdT , first in the presence AT, the ^ 
dTTP . These last ™ intervals (30 seconds 

on a gel and are carried out .or sn0 followed 
to 3 ninutes) in order to add sequentially 30 
by 10-30 T to the V ends of the pollers. 

c, Synthesis of the second strand of stochastic O.A 

_ t the end of the 

Each molecule of vector possesses, 

pr ecedin 9 operation, two stochastrc se.u .ho 

.„ complementary. The mixture of ^~ r ' of tbe 

in cuhated in conditions . ; -^' ^ is . K1 , pB 

complementary extrernit.es U50m* «.« 

7... I- B»A at 6»- for 10 mrnutes folio y _ 

o->«r at a rate of J to r 
the temperature to 22 C at a of 

The hybridized pollers are then reacted wxth 60 ^ 
the lar g e fragment ' °\r^^ ^ at 

presence of the four nucleotide "^^^^ o£ 

. e Th ;, s tep accomplishes tne synw 
4 . C for two hours Thrs s P ^ ^ 

the second strand from tne fllrect synthesis 

The molecules which result from thrs di 
starting from linearized vector are thereafter 
transform competent cells. 



d) 



Transformation of competent clones 



- pp. 101 of C600 at a concentration 

10 0 to 200 ml of content HB101 of C6 stochastlc DN a 

10 n ,rr,i are incubated with the stou 

of 10 10 cells/ml, are i Dre£ence of 6 mM CaCl 2 , 6 mM 

^e—t.ion (from above) in the presence 2 

^HCl pH8 , 6 mM MgCl, for 30 ^. ; ^ 

temperature shoe* of 3 mi - teE i ; n t ^ 4 C o ; to ml of *ZY 

fixture, followed by the addition of 400 ^ 

culture medium, without ^xb"txcs . ^ 

culture is incubated at 37* for 60 ^ ^~ ^ of 

10 litres by addition of NZ Y medium containing £J 

iU . . c * 0 „t* of incubation at 37 C, 

ampicillin. Alter ^ »o^- cf 

amplified culture is centrifuged and t ^ ^ 
n c iQ ivophilysea and storea at 

» ™ ltUre C ° n ;r „ 1 ,toc h ..ti= ,e,e inserted into *. 

each containing a unique 

expression vector. 

rleotide^^ithou^cohesive^ 



c 1 i c o n u c 



, ur . , s b ~ed on the fact that polymerization of 
Thls proceed .s b- oligonucleotides permits 

judiciously chosen palindromic o 9 n 

r .^^Hc aenes which have no st-<j F 
construction of stochastic gen ^ ^ 

«f thP six possible reading n«»— ■■ 
in any of the six p representat ion of triplets 

same time assuring a balance p ^ g 

sp ecifying all amino - 'J" pr t£ins which result, 
repetition of sequence mot^s m P ^ 

fulfil these criteria: 

a) Choice of a group of octamers 

The group of oligonucleotides following: 

5' GGAATTCC 3' 
5' GGTCGACC 3' 
5 ' CAAGCTTG 3 ' 
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5 ' CCATATGG 3 ' 
5' CATCGATG 3' 



. ^ (fh nc self complementary 

i8 composed of 5 panndro.es (thus 1 * c 

sciences, where it is easy to ven , tha the. ^ 

polymerization does not generate any stop 

specifies all the amino acids. 

r'her qroup of palindromic 
Obviously. one ca» »t,lu. o.her , P _ ^ 

octamers which do not generate an, ~° 
speclIy .n the -ino acid, tound in P oi yp.pt He. . Cl« - 
it is also possible to utilize non palindromic g^s c 
tenets or other oligomers, under the condition the 

^ lw »t. £ -, in , »»>. .«.»«.* •» «. 

used . 

bl Rs se*bly of a stochastic gene fro, a grou F o£ octamers. 

t*<= ol iaonucleotices 
A mixture containing 5 ^ each o. t..- . 

.Kncnhnrvlated at the 5 
indicated above (previous - 
position by a standard procedure) is reacted in a 
position Dy i mM ATP 10% polyethyleneglycol, and 

volume containing 1 mM A1F, F . 3t 

ivr for six hours. ini*> 

state and without cohesive ends, ihe resu.-ng p 
iolated by passage ever a molecular sieve < logel 

■4-v, 70 to 100 oligomers. Atter 
recovering the.. ■ agai „ sub „ itte a to 

concentration, this fraction , . „ ,. nd . r the 

catalysis or polymerization by T4 DBA ligase under 
end it ns described above. Thereafter, as described above 
those polymers which have assembled at least 100 oilers 
are isolated. 

c) Preparation of the host plasmid 

The POCS egression vector is linearized by S»l .«»~ 

w *f~r- sc: described above. The vector 
the appropriate buffer, as desc^u- 



linearized by SMa 1 does not have cohesive ends. Thus the 
linearized vector is treated by calf intestine alkahne 
phosphatase (CIP) at a level of one unit per microgram of 
vector in the appropriate buffer, at 37'C for 30 nunut^ . 
The CIP enzyme is thereafter inactivated by two successive 
extractions with phenol-chloroform. The linearized and 
dephosphorylated vector is precipitated in ethanol. then 
redissolved in water at 1 nig /ml. 

r ^4- ^r-K - c+- r oprp^ to the vector 
d) Ligation of stocncis^c c,t;n__ l . 

Ecuimolar quantities of vector and polymers are mixed ana 
incubated in the presence of 1000 units of T4 DNA ligase, 1 
m A TP, 10% polyethylene glycol, in the appropriate bu£fe>, 
for i2 hours at 13 C C. This step ligates tho stochastic 
polymers in the expression vector and forms double stranaea 
circular molecules which are, therefore, capable of 
transforming . 

Transformation of c ompetent clones. 

Transformation of competent clones is carried out in the 
manner previously described. 

HI) a„ B «mblv of ste notic: genes starting from i a_ a roup_^f 
heptamers. 

This procedure differs from that just discussed in that it 
utilizes palindromic heptamers which have variable cohesive 
ends, in place of stochastic sequences containing a smaller 
number of identical motifs. 

a) Choice of a group of heptamers 

It is possible, as an example, to use the following three 
palindromic heptamers: 

5' XTCGCGA 3' 
5 ' XCTGCAG 3 ' 
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5 1 RGGTACC 3' 



a I?- a nr T t and where 
v - A G C, or T ana R-A or 1 

^;«rt»tio» «»«ot ,».»« any "stop" codons .no rorms 
triplets specifying all the amino acids. 

Clearly it is possible to use other groups o £ heptamers 

fulfilling these same conditions. 

. • _ ,_ „<r prniin of heDtamers 

b) Polymerization 01 a c,i~i-t- 

rarrxed out exactly in the fashion 
This polymerization is carriea 

described above for octamers. 

c) Elimination of cohesive extremities 

The polymers thus obtained have one unpaged base on their 

Thus it is necessary to aad tne 
<- W o 5' extremities, thus, ^ 

" ^ , , Prr .pcnnndipo 3' extremities. 

^ r . r Tp.p^an' base to the cor.es^^^-. 

j ^,,+- c follows: 10 mi^rog^cii.^ 
This is carried out as toixCv,s. ? vlrrr .. 

J - * „- r ~ . re ^e-+ed with 10 units of tne Klenow 
stranded poiymer*. ^e^.eu lon .HrtP- 

„ Q r,f the four deoxynucleotide 
enzyme, in the presence of the rour 

phosphates in a volume .« , 00 ^.t.-C. 

minutes. The enzyme is inaetivatec by pn...~ - 

traction, and the polymers are cleansed of the residu 
free nucleotides by differential prestation. Tne 
pciymers are then ligated to the host plasmid (previous y 
linearized and dephosphorylated) by following the 
procedures described above. 

It is to be noted that the two last procedures which were 
described utilize palindromic octamers or heptamers which 
constitute specific sites of certain restriction enzyme^ 
These sites are absent, for the most part, from the P UC 
expression vector. Thus, it is possible to augment 
considerably the complexity of an initial preparation Oi 
stochastic genes by proceeding in the following way, 
p la smid DNA derived from the cuiture of 10 inoepe 
Lnsformants obtained by one of the two last prccecure 



oi 
the 



. . „ above is isolated. After this »« is purified. 

deS " la Uany digested by Cl.l restriction enzyne 
lt 1S part aly g i rastrjctlon enzyme 

„ Tin After inactivation ot rhc 

(procedure III). A t ^ ^ ^ 

partially digestec DN« is tre ^ ^ 

has the effect of creating a very 9 

. nre£ while conserving the fundamental properties 
sequences, wnne <, u . ew p _ semb i e of stochastic 

the initial sequences. This new e,.se..*-e 

par th e" be used to transform competent cells, 
sequences can the- be u M nrp£ , bv procedure II 

in addition, the stochas^c g-n_ _, essira 

^ intart from the pUCB ex^e^iu 
and III can be excised intact fro \ tQ the 

-::»/::::■':;• :"^.r;r » 

sequence s . 

Beco^ination within the stochastic genes » the 

procedures just described, which resuit ,ro» t 
: r - r al -.oloov due to the recurrent „olecu!ar .otits.^s 
- important additional method to achieve ±i ^ 1 ^ 
an imP ° r ; ant _ ^ _ dina seque nces. This results xn an 
mutagenesis u±. , ^ wh irh can be 

examined . 

• :1 all the procedures to generate novel synthetic 

Finally, for all the pr techniques 

g enes, it is P°»»** " ^ ' ^ such as a change of 
to modify genes in_v±V£ or i» 

reading frame, inversion of sequences with respect to 
promoter, point stations, or utilization of host 
expressing one or several suppressor tRNAs . 

■ • ^„ if ic clear that it 
In considering the ^ve description ^. ^ 

is possible to construct, in vitro a different 
number (for example greater than » bllll °»> or o£ 

ge nes, by enzymatic polymerization of nuc Lot x- « ^ 
oligonucleotides. This polymerization is earn 
stochastic manner, as determined by the J 

* ,inn, of the nucleotides or oligonucieo 
concentrations or uie 

present in the reaction mixture. 
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polymers to the expression vector. 

*- ra ncfnrmation or 

In the two cases, the next step x j »~ ^ 

^ mmoetent bacterial cexxs to- 
^nsfect.on of tiwte , cloning the stochastic 

cuiture,. Th„ « uhere t hey .» 



genes in 
propagated and expressed. 



• „.,., ,,Mrl- were describee 

1v ^ Edition to the P rocecur«= = 

j. early, -ai . other methods wr.ich ,r c 

abOVe ' " 15 ntUis of stochastic sequences. In 

particular, i- - . ^-ar^ed oliaomers of uisn 

biocherr.icai i^a,- f ^ - treat these 

TO » obtained by che.ica! synthest t dures to 

vva ^> established 

segments of 

generate double stranded Dt,A (cDNAl 



genes - 



• „ or sele_ S ti 2JU >f_cio E ^^ 

Screenin g nr seienxy 

of the procedure according to the 
The further step of P tra nsformed or 

, Anei cts in examining the 
invention consists qrree ning, in order to 

— f — c r tnrr»r :::: - 

product (KHA) or translate product <P ^ 
Lired property.These propert.es car, be, 
enzymatic, functional, or structural. 

of the process, according 
One of the most important aspects of P eous 

1Q that it permits tne sn 
to the invention is t able pro duct (RNA or 

screening or selectxon of an ^ Jn 

protein) and the gene v,„ich pr 



biochemical properties. 

^ 0 as non-limitating examples, 
we shall now describe, as n clones 
preferred procedures for screemn o ^ e _ of 

of tran s f o,ea c.l, .« ^ -.^ ^ medicai 
interest from the point ox Mew 
applications . 

obtaining, polyclonal or ... „„,» iri or 

another type of — or has been rendered, 

• <. „ B 4. where that molecule is, or nab 

interest where t these ■ antibodies as 

immunogenic, and there.fte ^ ^ 

probes t o identify among the ' ein react 

transformed by stochastic genes, .hose who p ^ 

- r .i^ Q Aie*. This reaction is a resu. 
with these — °- 6 J between the polypeptide 

structural homology which exists 

v ■ ori hv the stochastic gene and tne 
synthesized by the num bers of 

i T4- ie: nossible in tnis wa> -~ 
molecule. It is possiu .<. nno . n r antigenic 

- 1 pr °- ins rtiST-^rrL' jei 

determinants on the initial mo or block t he 

are Uable to simulate, stimu^ >' ^ this 

effect of the initial molecule. It wi 1 
means of selection or screening may ^el f 

i .ir and biochemical applications . Beiow 
pharmacologic and mode of 

describe, as a non limiting example, thi- 
operation in a concrete case : 

factor) is a small protein present 
EGF , (epidermal growth , faco r) ^ ^ Qf 

in the blood, whose role is to ^ 

epithelial cells. This effect 

interaction of EGF with a specific ™ * 

membrane of epithelial cells. ^^l^ ^pL to 

EGF are prepared by injecting animals with EGF coup 
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KLH 



KL H (Keyhole limpet hemocyanin) to augment the 
rltv of tie EGF . The anti-EGF antibodies of the 
immunogenic lty ol tie nas^aqe 
a-ir,al-. *re purified, for example, by passage 
immunizes anima^- --c i 

*n affinity column, where the ligand is EGF or 

bl nd those stochastic peptides o, P r -«- 

>o» those of the initial antigc. - 
resemble those 01 shown by 

containing such peptides or proteins a.c 

toradiograpby after mention c £ the soli, soppc, - 
radioactive protein A, or after mcuba^on 
radioactive ar.l i -antibody antibody. 

The se steps identify those crones, -h of -J™^ 
one protein (and its gene, reacts with the sc. 

f ,.,M,ip srreen among a *t^,- - 

antibody. It is feas.bxe t~ s. 

^ mlnrie- of bacterial cells or vu- f - 
numr>er oi colonic ^ fo .,: ble to 

example, on the order of 1'Oou 00u, «»-" " of , 

detect extremely small quantities, on 

nanogram, of protein product. Thereafter, t,e identif 

£ -^k^-i^i ppIIs to determine it tney 
vitro in cultures of epithelial cells t 

inhibit, simulate, or modulate the effects of E.F o.. t. 

^ nroteins so obtained, some may be 
cultures. Among the proteins 

. . for the chemotherapeutic treatment of 

can be improved by mutation of the DNA g 
Toteins, .ays anaiogous to -" Scribe above., 

variant of this procedure consists in purify g 
stochastic peptides, polypeptides or proteins, which can be 
used as vaccines or more generally to confer » -» 
against a pathogenic agent or to exercise other e ec o 
t „e immunological system, for example to create 
tol erance or diminish hypersensivity with respect to 
given antigen, in particular due to binding of 
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• ^ uith the antibodies 
peptides, polypeptides or p. ^ 

that it ib 

vitro as well as ir^ivo- 

. lv in the ensemble of novel proteins which 
More precisely, in an tigen X, each 

react Wlt h the -t^e a g t ^ ^ ^ ^ ^ 

haS " 16351 ^ Z o epitopes in common with X. This 
ensemble has an ensemble o. epit P en£eI ^ le as a 

,. li7 .^ 01 of the ensemble o. su. 
permits utilization c _ example, 

fn confer immunity against X. It ^, 
V3CCine t0 f P or several of the capsid proteins or the 
easy to purify o e or ^ ^ ^ into 

hepatitis B virus. These pr ant ibodics 



an animal, for example, - — - • covered by 

, Sr „ the initial antigen Con be .ec 

correspond^ ~ - - - antlb odies may be usee, 

affl nity -lu- PU-f cation- ^ ^ 

as .escribed above , to loen * at lea5t one of 

or . e protein having an epitope purification, 

. ^ the initial antigen. After pur,., 

the epitopes - the i ^ Qr in 

these proteins conferring prot ection against 

combination) ^ h ^ ion of the vaccine does not 

hepatitis B. The fin P pat hogenic agent, 

require further access tu w..«= 

* ■„„ the description of the procedures above, 
No te that, during the de. P ^ h£Ve 

a number of «n. ^ ur£ may require the 

been described All ^ the P ^ & transf orI * ed 

purification of a particular P carrie d out by 

^one. These prote.n -^^/^p "Liar, 

.pushed - - iM an by 

technrques of gel chroma P ^ the proteinB 

affinity chromatography . e(J in 

generate, »y the stochasti . ^ « » » , seque „ ce 

of the [i gal.ct dase antib odies, and 

chromatography aga in st anti (J g ^ u 

all o-s the subsequent cleavage of K * ^ ^ 

to say, allowing separation of the 



^ we describe the 

polypeptides an, selection baseo o.. - - 

Lend method of s— or ^ polypeptiae5 

det ection of the capacity o 

„ catalyse a specific reaction. 

..^e screening or 

in the particular "be function 
.election m tne p ce normally a r — 

cataiysmg the cieavag. ; £ - r d „ e , ,, gall „U he 

fulfilled by enzyme ? 9 

described. 

fts abovc aes-ibea, the ""l^^ZZV"™ 

in generating a very ptet ,t». To be concrete. 

— — sin9 a „: is :;: S e ^ it:*; 

for example, one may cn~ _ n the rss _ 

Va -4-^ S equem.es 
cloning of stocna,..- J ofctained ar e t., = .. 

restriction .if. «* P>»^ £ron „ h ose genome the 
in troa»cea m a ^ ^J^^, z , and a secon* gene 
natural 9 ene for 6 <Jalac ^ t ^ l6s 

EBG , unrelated to the f«.t * by tnown gcetic 

fo-ga! function, have bott, been ^ ^ ^ by 

methods. Such host cells I . lysis , and as a 

themselves to catalyse lacto ^ ^ gro „ tb . 

conseguence to use lactose » £or sc „ enin g 

Th is permits utilization of such 
• for fe -gal function, 
or selection for f> 1" 

analyse transformed E coli 
A convenient bi ° l0gi °f "^J^ , e nes expressing a ^9=1 
clones for those which have nov ! _ t „ ns£or „ed as 

functio „ consists in the x „ gal ln th e medium . 

describea in petri dishes - ^ ^ . ^ . ga l 

X„ this case . ^^.^W:; colonies. By using such a 
function are visualized as b ^ ^ ueaVs 

bi ological assay it x^J o£ characteristic 

catalytic activity. The P 



, rannes from 10 and 10,000 product molecules per 
enzymes ranges uuu. 

second . 

Suppose that a protein .y»th..l^ by a stochastic ,o«. 

, ' W..X specific activity, on the order of ». 
per 10 0 seconds, it renins possible to detect such 

atactic activity. Xn a petri dish containing X-,. >n 
medium, and in the presence of the non « .b o. 
inducer 1PTG , rsopropy l-D-thiogalactosrde, vrsualrzatron o ; 
. blue region retires cleavage of about 10 to 0 

■ ■ A bacterial 

surface 



X-gal par square millimeter, 



molecules of 

= fo^v pn7v^ie and occupying a 
colony expressing a weaK enzpe a 

. K , in 7 to 10 8 cells. If each cell has 
area of lmm2 has about 10 to 10 ceil 

cnly one copy of the weaK enz^ , ta... 

Hpavaoe of between lO'OOO and 100 of x-qal to nc 
would require between 2.7 and 270 hours. 
Since under selective conditions it is possible to an,pxx ; y 
the number of copies of each plasmid per ceil .o . t. ^ 
cop.es per cell, or even to 100 or 1000, and because up o 
„f the protein of the cell can be specify x>y t.c ,e„ 
^erthe duration needed to detect a colony blue 
case of 100 enzyme molecules of weak activity per cell 

r a 9-? +- r . •> 1 hours, 
on the oraer oi u.z* ^ 

As a consequence of these facts, screening a very large 
number of independant bacterial colonies, each express., a 
different novel gene, and using the capacity to express a 
P>-gal function as the selection criterion, » fully 
Lsible. It is possible to carry out screening of about 
200 0 colonies in one Petri dish of 10 cm diameter Thus 
about 20 million colonies can be screened on a sheet of 
gal agar of 1 square meter. 

It i£ to be noted that bacterial colonies which appear blue 
on X ,al Petri dishes .l,ht be false positives due to a 
station in the bacterial genome which confers on it th 
capacity to metabolize lactose, or for other reasons than 
o Mich result fro. a catalytic activity of the nove! 
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^ k fhP cells of the colony. Such false 
protein -P»sse, ceU ^ ^ 

p : ii : e i:::„:ci *. «. — — *• 

U " " P EBG -E coli host cells. 1£ the ?-<!<" 

»»«. t or.in 9 ' ^ novel protein coded by the new ,e„e 
activity is due to the P tran sformed by 

i„ the expression vector, all those 

Lt vector .ill exhibit f>- 5 al action m _ 

th e initia! biue colony is doe to a of the 

^^ ^ is a rare event and independent 
of the hos t cell, it xs r ^ 

transformation, thus tne number o. functlo n 
of transformed E coli capable of expressing gal 
will be small or zero. 

^ .„wer of mass simultaneous purification of all the 

i:r s i: ^rrrrL^ carry ^ > ~ : 

stressed, ^ap. cata lvtic function, and tnat 

The 5 ; Ilbflity -t a new pePtrde or polyp^de carr es 

- r — i ea =t weakly is 10 . - n -^ 

o»t this tunction .ea bacteti al host 

probability that a clone o£ the co 

T. "trU or^ bacteria which are screened. 
^ l" wUl be attributable to the hovel ^ 

* — r..^T»- :r- rc-rtation. 

these expression vectors P bact erxa 

.„„„.., - ..... r r ,»;,• — ., :;... 
;::„:::,.:: r.x„.. ....... ..- — •• 



■ Allowed by such retransf ortation , allows the 
col omes, no^ y vectors truly pos;t 

a " K "" !/c lytic .otivity. despite a hi-h background 
for a desired catalytic function. 
rate o£ stations in the host c e u= .or -he - 

Foll owing screening operations of *«• . £ ^ P °;"£ 

to purify the new protein by est b ^ ^ 

production - r. ct Pi : h " ia» t i«ti«. o. the usefui 
poss lb le by he . c th ^ 

protein occurs « e h ConMqu =ntly. either 

the gene coding -c. the he r-ovel „ene 

^ ^""^To' TIo I appropriated egression 

U is -asibie to appiy this .ethod - ™^ ^ 
en^tic function for which an ap o ^ 

assay exists- For s.cn -cr to the 

... faction which is sought be 
the enzymatic fj " Ctl ° QUt scree nings not oniy 

~„->i t+- ic possible to t-dij-j , 
host ceil. F other aesned 

£or an en^atic function but for ^ ^ ^ 

property for which it i P ^ carry 

appropriate biological css.y. * visua Hzed 
out , even in the si.ple case of (3 gal f 
on an x-gal Petri plate, , screening of on 
m Ulion, or even a billion novel genes 
activity or any other desired property. 

SeJ ££ UHLJ>fJSU 1 ^^ 

„ „ it is possible to use selection 
On the other hand, it P other wise, where 

technics .r ^y Property cata y^ rty ^ ^ 

Z^TZ of z r : - ~- ~ £ 

can be used to select non -limiting , but 

expressing the desired novel »' ' ™ fu „ ction 

concrete example, selection for P gala 
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s ,a,l he described, appropriate clone ^ ^ 

i, not able to grow on described above, it 

T „us. after carrying out ^ ' of host cellE 

j, possible to culture a very large ^ ^ ^ 

transformed by the expression vecto progr essive 
9e „ es , under selective -dit-s ■ "tb ^ JT^^ q£ 
dilution of otber sources o ^ ^ q£ such 

Xactose alone from the start recoInbi „ation , or by 

selection, in^i™ mutagens by^^ ^ 
explicit, recovering ^ ^ lit[ „ by various 

m „t,geni 2 ing their f^^VTe^igue. permits 

mutagens, or by any fulfil the desired 

adaptive i-provements in the capa y tecnnlq „ e s and 

catalytic function. «hen bo h s^c ^ as 

convenient bioassay technics ^ ^ _ 
i„ the present case, it V represe „tation of host 
technics initially^ enr the^P^ ^ ^ ^ . 
bacteria expressing the p 9 ^ establisU 

screening on a Petri plate . ^ ^ ^ absenc(s 

efficiently which are the positive ^ ^ pro9essi »,l, 
c.f convenient bioassays, »PP ^ y ^ QI , 

stricter selection is the easie ^ e!tpre ssion 

small nu*J>er of distinct host « desirca 
vectors code for the proteins catalysing 



reaction. 



It t . possible to utilUe «"» J^" ~£ 

proteins having a large ^ ^ catalyse 

functional characteristics beyo ^ ^ carry 

a specific --ion - e ^le^ ^ ^ 

out a screen or select ^ HocJ( the 

cis-regulatory sites on the functions, or block 

expression of on, , o« ^ tran scriptio„. 
transcription of the DNA, or * 

f f coli, a clone mutant in the 
For example, in the case of E. Co ■ expresseS G» -gal 

repressor of the lactose p lactose 
function constitutively due to the 
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operator is not repressed. All cells of this type produce 
blue clones on Petri plates containing X-gal medium. It is 
possible to transform such host cells with expression 
vectors synthesizing novel proteins and carry out a scree, 
on X-gal Petri plates in order to detect those clones whicn 
are not blue. A^ong those, some represent the case where 
the new protein binds to the lactose operator and represses 
the synthesis off -gal. It is then feasible to mass isolate 
such plasmids, retransform, isolate those clones which do 
not produced -gal, and thereafter carry out a detai-e- 
ver i f ication . 

As mentioned above, the process can be utilized in order to 

create then isolate, not only exploitable proteins, but 

also RNA and DKA as products in themselves, having 

exploitable properties. This results from the fact that, or, 

one hand, the procedure consists in creating stochastic 

, ™ 1T - ir-v r--v ^4-pT-art directlv with other 
seauences of DNA wnich may "-e.o- 

ceUuiar or biochemical constituents, and on the other 
hand, these sequences cloned in expression vectors are 
transcribed into RNA which are themselves capable o. 
multiple biochemical interactions. 

m sample of ^ «f the procedure to c r g^te^nd^elect 

for a DNA which i s useful in itself. 

This example illustrates selection for a useful DNA, and 
the purification and study of the mechanism of action of 
regulatory proteins which bind to the DNA. Consider a 
preparation of the oestradiol receptor, a protein obtamea 
by standard techniques. In the presence of oestradiol, a 
steroid sexual hormone, the receptor changes conformation 
and binds tightly to certain specific sequences m the 
genomic DNA, thus affecting the transcription of gene, 
implicated in sexual differentiation and the control of 
fertility. By incubating a mixture containing oestradiol, 
its receptor, and a large number of different stochastic 
DNA sequences inserted in their vectors, followed by 
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nitration of the mixture across a nitro el 
membrane, one has a direct selection tor those stocha c 
D[J A sequences binding to the oestrogen-receptor complex 

^ , n . P f P ;n are retained ny 

where only those DNAs bo.no to « p.o.c... -e 

the membrane. After washing and elution. the DNA liberated 
; rom th e membrane is utilised as such to transfo 
bacteria. After culture of the transformed bacteria, the 
vectors which they contarn are again purified and one or 
.ev-r.l cvrles of incubation, filtration and transformation 
^"carried out as described above. These procedures allow 
th . isolation of stochastic sequences of DNA having » 
elevated affinity for the oestradiol-receptor complex, -uch 
fences are open to numerous diagnostic and pharmacologic 
applications, in particula, , for developing synthetic 

. r * fp^iiitv and treatment or 

oestrogens for the controx 01 ^.a..t) 

sterility. 
CreationanjL^^ 

T^^^TTT^r^Tor stochastic DNA sequences, 
produced as has been descrxbed and cloned in an expression 
vector. It follows that the RNA transcribed from these 
sequences in the transformed host cells can be use.ul 
products themselves. As a non limiting example, it 
possible to select a stochastic gene codxng for 
suppressor transfer RNA (tRNA) by the following procedure 
A la rge number (> 10 8 ) of stochastic sequences are 
transformed into competent bacterial hosts carrying a 
•■nonsense" mutation in the arg E. gene. These transformed 
bacteria are plated on minimal medium without arginxne and 
with the selective antibiotic for that plasmid (ampxcxllxn 
if the vector is P UC8) . Only those transformed bacterxa 
which have become capable of synthesizing arginxne will be 
able to grow. This phenotype can result either from a back 
mutation of the host genome, or the presence in the cell of 
a suppressor. It is easy to test each transformed colony to 
determine if the arg + phenotype is or is not due to the 
presence of the stochastic gene in its vector; xt suffice 
to purify the plasmxd from thxs colony and verxfy tha. 



■ - i -a E cells tran-forr.cd by 
confers an Arg+ phenotype on ,xl a*g 
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reactions 

Below we describe another «... of selection, open to 

ependent applications, based on the ■ 
s ™ taneous ana parallel ..lection of a certa; , . - 

vo*. of ratalvsinq a connected sequence 
novel proteins capable of catai,— s> 

of reactions. 

Th e basic iaea of this method i. the following : given .» 
LtUX ensemble of che.rcal ^Pounds censure 
aiding Moc*. or events of -str™ 
is h0 pea to synthetic o ne or .... ; ----- 

compounds by means of a catalyse 4 reaction 
. hrre ex-ts a very large number of reaction 
reactions, there ex.-.b d for 

routes which can be partially or completely subs-.-- ed o 
one another, which are al! — oayna^ically poss b^ 
wh ich lead fro. the set of building bloc*, to th ae 
target co-pouna <s) . Efficient synthesis of a targe 

ol ola i/favored if each step of at least one reaction 
pathway leading fro™ the building blocK compounds ^„ 
Lget compound is "^ed of reactrons each.-c h^ 
catalysed. On the other hand, ax 

^ orlant to determine which among the many independent 
partially independent reaction pathways « catalysed 
partially / we have shown how it is possible 

the previous description, we 

to obtain a very large number of host cells each 
expresses a distinct novel protein. 

Bach of these novel proteins is a candidate to catalyse one 
or another of the possible reactions, in the set o. 1 th 
possible reactions leading fro* the 
blocs to the target compound. If a sufficie, >tly 
n-er of stochastic proteins is present in a c 
mixture containing the building bloc, compounds, such 
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^sufficiently large n-«i of the possible reactions are 
catalysed, there is a high probability that one connected 
, e of reactions leading fro. the set of uU i 

seq , md will be catalysed by 

olocK compound to the targe. „o... r ~ 

a subset of the novel proteins. It is clear 

rocedure can be extended to the catalysis not only of one, 
but of several target compounds simultaneously. 

■ rin iP it is possible to proceed as 
Based on this principle it P 

^ ^Plect in parallel a set o. 
follows m order to select y L 

rltalvsinq a desired sequence of cnet.av.ai 
proteins cataiysmy 

reactions : 

, .^ (fv the desired set of compounds constituting the 
^ blocXs, utilising preferentially a reasonao y 
Urge number of distinct chemical species in o.«r 
Urease the number of potentia! concurrent reactions 
leading to the desired target compound. 



..using an appropriate volume of reaction medium : , 
very large n-e. of novel stochastic proteinase 
ro m transformed or transfected ceUs synthesizing 
, (f , s C8 „ y out an assay to determine if the target 
proteins. - thU £orn , atio r. 

compound is formed. If it is, conn 

compoun the mixture of novel proteins. If 

1 «» should contain a subset of protein, 

fatalysing one or several reaction pathways leading from 
t h tuildlng bloc* set to the target compound. Purify and 
vide the initial ensemMe of clones which synthesize th 
..t of novel stochastic proteins, the subset which is 
Quired to catalyse the seguence of reactions leading to 
the target compound. 

More precisely, as a non limiting example below we 
describe selection of novel proteins capable of cay- 
th e synthesis of a specific small peptide, in ^ 
pentapeptide, starting from a building b 
constituted of smaller peptides and amino acids. 
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* a k » a linear sequence of 20 
peptides .re -statute b * ^ ^ 

different types of amino acides or e ^ 
the carboxy terminus. Any peptide can be „ ptides 
6tep by the terminal condensation of two seller peptide 
J J two amino acids,, or by hydrops o a Ur g e 
peptide. A peptide with M residues can thus 
Li condensation reactions. The n-ar of reactl n . J 
which a set o £ peptides having pQ5sible 
can be interconverted is larger i ^ 

t This can be expressed as 
^ L^'in -ml given ensembie of peptides, a 
"^C Jl. independent or partially ^--n 
reaction pathways lead to the synthesis o £ a pe 
target peptide. Choose a pentapeptide whose pr » > 

•„„.t-i« hv some common as^ay teen. a i 
determined conveniently by some . nr , nW) 

K : n} , pressure chromatograpny ) , 
example HPLC (liquid phase high pressur 
or paper chromatography. Formation of a peptide 

* dilute aqueous medium, but if tne 

requires energy in a anute h -, r p 

■ in the condensation reactions are 

peptides participating m the co ^ 

adequately concentrated, formation of P eptxd 
thermodynamically favored over hyarolysxs ™ d 
efficiently in the presence of an appr = ^ 
catalyst, for example pepsin or trypsin, without q 
th. presence of ATP or other high energy compounds Such 
reaction mixture of small peptides whose amino acids ^ 
marked radioactively to act as tracers vith H, C. , 
constituting the building ^ set can 

sufficiently high concentrations to lead 

reactions. 

, t feasible to proceed as follows : 15mg °« 
For example, it is feasible top 

each amino acid and small peptides having 2 to 
acids, chosen to constitute the building bloc, set 
dissolved in a vo^e of 0.3 5 ml to 1.0m ° > 
phosphate buffer. A large number of novel P 
' generated and isolated as described above are punfi d 
their bacterial or other host cells. The mixture of th.. 
ovel proteins is dissolved to a final concentration on the 
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„ f n 8 to 1 0 mg/ml xn the same buffer. 0,25 .1 to 
order of 0,8 to l,u ly mi xture of 

0.5 .1 of the protein .ixtur. » * ^J,^ £or , to 
bull di„ g blocV.s. This is incubateo at 23 ^ ^ 

lnt " Va 'of the B 'xt«. o £ novel proteins. These aliquot. 
' Taly J b y c„rca t o 9 ra P hy usin g n-butanoX-acetrc 

are analysed ny solvent. 

f^.c. 20-24 by volume) as tne 
^d-pyridrne-wate. ^ <3. 6J0 y ^ ninhydrin ^ 

A ne L-u^uiuu ^ . ^fenq^vmq screen^) . 

u /.Mth or without mtensx.i^y 

autoradiography <*ith or Kl ,^ ina block set 

Because the c „a co s tl t . h^ ^ ^ 

are radioactively marked, activi ty high enough 

i+- will have specilic acuviu) ^ 
radioactive and it will P ^ ^ placQ o£ 

to permit detection at the e.e ^ 
,. r ,.^ ^rnmatoaraphic analysis, xt ^ pos 

, ;*' ssue liquid chromatography) which is taste: 
HPLC (high pressure nqu 

- • ior t-o carrv out. More generally, all 
and simpler to carry u rrreeauen tly it is 

luH . D . 0 nedures can be employed. Conse^e^ y 
analytic P i °^ uul .„„ roU nd nf less 

. Klp d^ect a yield of the target c.mpou 

possible to de.ect y ^ to the 

4-* a „ one nart per million uy 
compounds used as initial building blocks. 

. ., ... in the conditions described 

above, but not wnen a MDression vector 

Knt:t ceUs transformed by an expressi 

from host cens w fAr _, tion Q f the 

contain^ no J.n-.. «- ™ ontamina „ ts 

pentapeptide is ^ ^ o ° f , subset o£ t he hovel 

and thus requires the presence 
proteins in the reaction mixture. 

-TJT:,Z TJZL this sequence 

6X T t'here are about 5 novel proteins vhrch catalyse the 
1S 5, there are ^ Qf bacterla 

necessary reactions. If tne 

oontarnin, the expression vectors which code for 
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genes has a nu^er of distrnct no-e! genes 

h of V 000 '000 all these expression vectors 

l «< .transformed into 1.0 distrn c, sets 

8 «. * ratio of vectors to bacteria which is 

r in D K^fp^ia at a ratio ui 
I mcil lV - that, on average, the nu^er of b.cten. 
Tit which are transformed is about half the number 

^ • aenes ± e . about SOO^OOO. Thus, the probability 

of initial genes, i.e. ^ the 

that any given one of the 100 

»*- of 5 critical novel proteins is U/2) 
entire set of 5 cnt bacte na, about 3 will 

^ong the 100 initial se.s o ^ 

.-K^ rritical transfurnioiiLS. -i- 
contain the 5 critica 500 <000 rather 

^t<- the total number of new genes is onls 
— -00,000. By successrve ^-^^ 

isolates 5 ^ ^ ^ stochastic genes 

:;;r;::r-:;':rr;. ^^ T .: := :, s 

„ = „ w-e-e it is necessary to catalyse a -eg 
a case w,.e . i ^ „, need to be 

reactions anu ^ ^ en " - ,^ 115t the 

Mrflllel it suffices to adj — - 
isolated in parallel, c 

- m = + --r-« =;"- h that each set or ^.u 
^Itiplicrty Of *—^: tochastlc ge „es, ana to 

tact r.o a »rr. °\ * 1. The ^ut» r .u - 

use 200 sucn set^ wj. ^ ^20 ^ n m 

novel proteins are found in one set is ' " • « " 

Thus about 2 among the 200 sets will have the 20 novel 

target compound. The number of cycles q 
isolation of the 20 novel genes is on the order of 30. 

The principles and procedures described above generalise 

1 the case of peptides to numerous areas of che.rstry 
Tich chemical reactions ta*e place in aqueous 
temperature. P». and concen ^ ^ 
general enzymatic function. In each ccse it 

<-n dPtect the formation of tne 
make use of an assay method to detect 

,, e , Tr ic also necessary to chose a 
desired target compound (s). It is aiso 

Sufficiently large number of buUding bloc, expounds to 
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au g ment the »-er o £ reaction seances -hiO iead to th. 

target compound. 

Tne concrete example which was g iven for the a 
target pentapeptide can also be generalise as follows. 

, crHhed generates among other products, 
The procedure as described, genex 

stochastic peptides and proteins. These peptides or 
StOChaS P ^ivticallv or in other ways, on other 

proteins can act, catalytically or ^ trates on 

compounds. They can equally constitute the 

which they act. Thus, it is possible to select ( wr _ec ) 
: 0 the capacity of such stochastic pept.des or p.otexns o 
interact among themselves and thereby modify tne 
interat v , fm-tion o^ some among 

conformation, the structure or the function o. 

q ,- i ^-, ^ is possible to select (or screen) .01 
the capacity c£ these peptides and proteins W ^- J -- 
„; ' heaves, nydrdysis, condensation, 

^tp.ptid.ti«, or other r-etio. modifying the pepti es. 
elmpie, the hydroiysis of a g ive„ stochastic P ec id 
k the set of stochastic peptides 

■ =+ . i p? ct one member 01 tne sex. u~ 

by at xeast one ---. irp ^ k„ radioactive 

and proteins can be followed ana measure. ~, 

:X of the given protein , 

a ^ yture of the stochastic proteins in the presence o 

v, c mo Ca Zn Fe and ATP or GTP . The apperance o. 
ions such as Hg,Ca,in, te then 
radioactive fragments or the marked protein s then 
measured as described. The stochastic protein s, whi h 
: se this reaction can acain be isolated. alon g wi 
the /ene.s, producing the., by seguential diminution of the 
library of transformed clones, as described above. 

An extension of the procedure consists in the « 
an ensemble of stochastic peptides and polypeptides capable 
of catalysing a set of reactions leadin, from the initia 

ildin, blocks (amino acids and small peptides, to some 
the peptides or polypeptides of the set. It is th r ore 
also possible to select an ensemble capable of catalysing. 

°o„ P n synthesis, such a r.fle.ively autocataly tic set can 
be established in a chemostat where the procucts o. 



the 



.nn^tantly diluted, but where the 

reactions are constantly main tained 

nf the building blocxs ^ 

concentration ot tn is aicea 

constant. — «^ Sy "r; pe P U = - — = * 

■ rn ^v, e rnnplex set Ox pep^-t 
by enclosing ...e - -P nic aque ous environment 

standard techniques. In a y P react ion forming 

la r g er pept.es l«.r. * ^ ^ ^ 

liposomes, drives vater somes , henc e favors 

condensation reactions out of such an 

f l.roer polymers. Existent 

synthesis of larger P > ^ dimensional 

..tocolytic ense^le can synthesis of 

gel •-"^^^^.".n.'poiyp.ptiaes. The 

. stabie distribution of pep ^ ^ q£ 

appropriate reaction v o,u» a LO 

cpecies used, ana . ._; c 

fBV or the formation of peptide oo^ ; . ; - ; - 

The aistnbution of » h -" due t „ the eBerg ence of 

ensemble is free to va.jr - pppt ides and 

,..■..,4.-;^ en sembles. 

variant autocataiy ^ ^ , ta , ytic set may have 

Polyps which constitute an ense ^ le 
certain elements in common wiu ^ - _ by 
(constituted o£ coded pep^des and polypoid ^ 

cur proce dure, but ca al the ensesP ^e of 

uric the ^ 

establish such an autocatalytic set q£ 

- — ■ - "T^r- .T^ic set can 

transformed clones In addit^o ^ sto chastic 

contain coded peptides initially aut ocatalytic 

«t. TO rsolate this coded ^ 

the autocatytic set can recog „izihg a 

ionization in ah animal, polyclo autocatalytl c 
very large number of the constituents of 



set . 
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utilized to screen the library of 
ThP cn sera can be utnutu 

r haS Uc ,enes ,0 Una those 9 enes 

able to cosine with the antibodies present the 

Thi . set o £ stcchast.c g e„es expresses . l.r S . - 
ooaea stochastic P»t.l». wfc«h P« »" > „ 

of such an a^o^-a.^ f which 

.4= the library of stochastic genes, from 
diminution of the lioraxv ^ ^ be ^ n 

the subset detected by immunological netnocs h** . - 

subtracted. 

♦ lv tic sets of peptides and proteins, obtained 
Such autocatalytic sets p p 

~ «„n-K e r of practical appiKa^c..-. 
as noted, may tine a u^-c. t- 



CLAIMS 



! Process for the production of peptides or polypeptides 
^ microbiological means, characterized in that gene, w ic 
are at least partially composed of stochastic synthetic 
polynucleotides are produced simultaneously in a cordon 
roi Ueu, that the g enes thus obtained are introduced into 
host cells, that the independent clones of the transfer, a 
nost cells containing these genes are "-"aneous v 
cul tivated so as to clone the stochastic genes and lea 
the production of proteins expressed by each of these 
mastic genes, that screening and./or selection is 
carried out on such clones of transformed host cells to 
identify those clones producing peptides or polypeptides 
having at least one specified property, that the clones so 
xdentified are isolated, then grown in a manner so as to 

. , a . q . _ e ce nHdp or polypeptide having the said 
produce ai xeast w^e pe r . .r. 

property . 

2 Process according to claim 1, characterised by the fact 
that the genes are produced by stochastic copolymer ization 
of the four types of deoxyphosphonucleotides A, C — 
T starting from the two extremities of an expression 
v^tor which was previously linearized, then by formation 
of cohesive extremities to create a first strand of 
stochastic DNA constituted of a molecule of expression 
vector possessing two stochastic sequences whose 3 
extremities are complementary, followed by synthesis of the 
second strand of the stochastic DNA . 

3 Process according to claim 1, characterized by the fact 
that the genes are produced by stochastic copolymer izati on 
of double stranded oligonucleotides which do not have 
cohesive ends, in a manner so as to form fragments of 
stochastic DNA , followed by ligation of these fragments m 
an expression vector which was previously linearized. 
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o 1 characterized by the 

4 Process according to claim 2 or 3, cha 

fact that the expression vector is a plasnud. 

«. 4 characterized by the fact 

5. process according to i, — 
t h 3t the expression vector is P UC8. 

n fo claim 2 or claim 3, characterized 

6. Process accoramg to claim fraarent of viral 
by the fact that expression vector is a fragment 



DNA . 



7 . Process according to claim 2 or clai* 3, ^aracterized 
by tne fact that the expression vector is - ^ 
plasmid and viral DNA. 

^ i *-o 6, characterized by the 

p process duurcu^ 

fact that the host cells are prokaryotic cells. 

^ 1 ^ 7 characterized by the 

9. process accoramg to ^ai,..s * — 

fact that the host cells are eu^vO.. 

EProcess according to claim B, ch "* C ™ * ^ ^ 
that the cells are chosen among HB101 and C600. 

H^ocess according to claim 3, charact erized by the^fact 
that the oligonucleotides form a group P 
octamers. 

^™ to claim 11 characterized by the fact 
12. Process according to claim following 
that the group of palindromic octamers is 



group 



5' GGAATTCC 3* 
5' GGTCGACC 3* 
5' CAAGCTTG 3' 
5' CCATATGG 3' 
5' CATCGATG 3' 
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^ ri.im 3 characterized by the fact 
13 Process according to claim J, ci.di 

iJ, " ute , . „ m , nrouD of palindromic 

that the oligonucleotides form a group P 

heptamers . 

14 . Pr „ce S s accords to cUi- 13, ch«.ct.r».d by «* 

fact that the group of palindromic heptamers is 



following group 



5' XTCGCGA 3' 
5 ' XCTGCAG 3 ' 
5 ' RGGTACC 3 ' 



where X = A , G , C or T , and R = A or T. 

, 5 P.ocess according to claim 4 and one of the claim 12 or 

. .4.,^ ~-- f-ivc-t- isolates and 
14, characterized by the tact tnat — 

pu ifxes the transforming DNA derived fro, a culture of 
P I rt clnnee of transformed host cells obtained by 

in oe P end.nt c e in clai » 11 or in clain, 

proceeding m the ma^.e- -t- 

F 4.,- t(lP nrrified DNA by means ot at 

n then that one cuts the punnet 

13 ' the vir _ vV .v, ...P.ponds to a specific 

least one restriction enzyme vhx^ — 
es riction site present in these palindromic octamers o 

h„t absent from the expression vector being 
heptamers, but absent n , n. 

utilized, that one thereafter simultaneously tie— ~- 
Inse^le'of linearized stochastic « 
by T4 DNA ligase in such a manner as to create 
ensemble of m» contain, new stochastic seauence a nd 
th at one uses this new ensemble of transform *9 -A « 
transform host ceils and clone such genes, and 
one carries out screening and/or selects and isolat the 
new clones of transformed cells and that one grows these so 
as to produce at least one peptide or polypeptide having a 
desired property. 

EProcess according to claim ,. * "'J^ 

that the said property is the capacity to catalyse a gxven 

chemical reaction. 
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1 "or the production of 
17. Process according to claim * cter ize d by the 

several peptide, and/or polypept - ^ ^ ^ 
£act that th . ..id P^J-^ ^ ^ inuial 

sequence of reactions i , ~ 

chemical compounds to at least 

• 16 for the production of an 
"■ M " SS ""I!! ". l than one peptide -'or 
ensemble consisting or re£lexlve ly autocataly tic , 

poWPtid. which J ^ property 1S the 

^^^terized by the tact ,- n i P itseli 

to catalyse the synthesis of the e^e..— 
capacity to 1 oligop eptides . 

starting from ammo acids an 

¥ „ rlaiP1 1, characterized by the fact 
19. Process accoramg t. -aim - ^ ^ 

the said property is „_ ornes of a 

Actively the chemical and/or bio^rc.i - 

given compound. 

k„ fact 
■ _ t q characterize^ l..- 

20 . Pro ces S accords to ciar, . ». ^ c|p . ctty t o 
that the said property ^ po i yp eptide. 
selectively the catalytic activity 

, ■„ 19 characterized by the fact 
EProcess according » ^ or 

r l£ ; h : t t^n^ai — - - ^ one 

biologically active compound. 

. 21 characterized by the fact 
22 . Process according to c j ^ cho5en among 

that the said biologically actx P ^ growth 

the hormones, neurotransmitters repllca tion and/or 

or :r: r regulators ot re^xx 

n ■ i characterized by the fact 
23 . Process according to claim , ^ a given 

that the said property is the capacity 

1 igand . 



24 utilization of a peptide or po]ypepcide obtained b v the 
process according to claim 23 for the detection and/or 
titration of the ligand. 

25 Process according to claim 1, characterized by the fact 
that the property is to have at least one epitope similar 
to one of the epitopes of a given antigen. 

26 Process according to claim 19 and 25, characterized by 
the fact that the said property is the capacity to simulate 
or modify the effects of a biologically active molecule, 
and that screening and/or selection of the clones of 
transformed host cells producing at least one peptide or 
polypeptide having this property is carried out by 
preparing antibodies against that molecule, ar.d utilizing 
'these antibodies so obtained to identify those clones 
containing those peptides or polypeptides, then by growing 
.he clones thus identified and separating and purifying the 
^ptide or polypeptide produced by these clones, ana 
finally by submitting these peptide(s) or polypeptide ( s) to 
an assay in vitro to verify that it has in fact the 
capacity to simulate or modify the effects of the said 
molecule. 

27 Peptides or polypeptides obtained by the process 
according to claim 1 or claim 26, utilizable as active 
substances having a pharmacologic and/or chemotherapeutic 
action. 

28. Peptides or polypeptides, obtained by the process 
according to claim 25, utilizable to diminish, in_vitro or 
in vivo , the concentration of free antibodies, specific for 
TnTlTid antigen, by formation of bonds between these 
paptides or polypeptides and these antibodies. 

29. Peptides or polypeptides, according to claims 27 or 28, 
utilizable as agents to suppress immunological 

hypersensitivity . 
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30. Peptrde S or polypeptides, obtained by the process 
according to cl.i-25. -tlli"". agents to create 
tolerance with respect to the said antigen. 

31. Process according to clair. 25. charactered by the fact 
that the antigen is EGF. 

32 Peptides or polypeptides obtained by the process 
Icclaing to clair. 31. utilise lor the che.otherapectic 
treatment of epitheliomas. 

33. Process .cording to claim 1. eharactenzed by the fact 
tha t clones of transform host ceils P rod U cin, , p.P or 
polypeptides having the desired property are 
Lolled by affinity chromatography on -tib = e ; 
corresponding to a protein expressed by the — 
fragment of the DNA hybrid. 

-j ^ „v 2 r*^^^p^^zed bv the fact 
34 Process according to claim 3,, chara.^^e. . 

- fran ,Prt of the DNA hybrid contains a gene 
that the natural fragment or ine . 

expressing 3 -galactosidase , and that one identifies an- 
Ites \l said ciones of transformed cell -t s V 
affinity chromatography with anti- f> -galactosidase 
antibodies . 

« Process according to claim 1. or clai. 34. =h.r.ct.ri»d 
Z the fact that after expression and purification of 
hybrid peptides or polypeptides, the novel fragments are 

separated and isolated. 

36. Application of the process according to claim 25 or 

~* =, vsrrine characterized 
claim 26, for the preparation of a vaccine, 
by the fact that antibodies against a pathogenic agent are 
obtained and used to identify those clones producing at 
least one protein having at least one epitope 
one of the epitopes of the pathogenic agent, that 
corresponding clones of transformed host cells are grown in 
such a manner as to produce this protein, that the protein 
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is isolated and 

« ! ^a„ dthati PUrifie<i ..he cul turer 

vacci„„ , . s F'ottin is US ed f cloneE o 

ine ■»•»« «. p athogenic ;;; d j° r «« p^ uction of . 

least on. ^""^ ^"ct.,!,., ^P-ation 

th « these " g ""bodies a„ ^ ° f a " 

an ib a ; tib0dieS "cove/ea 9 P ™^< 

antibodies ^ ered and vurin * 

P >°^9 at Jeast " e USeS to identi ^at 

5imi lar to „ ' ° ne P r °tein havino ot cl °"" 

«°»« 0 t °; e ° f ^ "P'top.. of the Z ^ '"^ 

a/e '""S^ >»„ eeas °" t °» 

° f an anti-„ BV vaccine. " Sed for the 

38 -Process 

the host TJn" 9 l ° ^ '< ^aracte^ec D 

osi dase r n -ther "tr^ 

COli - and that t„ " 0,5 EBG 9e„e th . 

an x „ , hese transformed h ' 15 z p 

'"Sal medium a i,„ ost cel ls are 

industrial * Cl °"° °* host JT, tMs DKA 

« Protein Pr ° dUCti - of at Jeast' CW1 " a PP« P ri at e for 

■*•» Ulth ^-..uctosidasrL::,:;^^. poi ^«^ 

39. Process ^ 

^at the CCOrdin 9 to cl aim 2 ch 

th * said property i. " ' Charact eri 2ed b 
impound. ty is capacity to bind * the fact 

lo a given 

"•Process accord 

that th CCOrdl "9 to claim 39 „, 

Prides ' • h -acte r i, ed by the 

"•Polypeptides and prote . n ^ ch °-n MO n g th . 
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41 .rrccc.s « .»,«. 4.. characters by the fact 

that the said proteins are proteins regulating the 
transcription activity or replication of MA. 

EProcess according to el.i. 39, charactered by the i.et 
that the sard compound i. chosen at.ong the sequences of " 

and RNA - 

43. Proteins obtained by the process according to claim 4C 

or clairr. 42. 

44 Process for the production of DNA. characterized by the 
fact that, in the sane milieu, genes which are at least 
Partially composed of stochastic synthetic polynucleotides 
L prod'ueed, that th, genes so produced are 
into host cells an a manner to produce an enseal* 
transformed host cells, that these are grown so as to 
produce independent clones of the host cells so produc , 

that screening and/or selection is carried out on this 
LmMe to identify those host cells which contain those 

stochastic sequences of DNA having at least one ^d i eo 

property, and that such DNA is isolated fro. the identitied 

cultures of the host cells. 

45. Process according to claim 44, characterized by the fact 
that the said property is the capacity to bind to given 

compound. 

46. Process according to claim 45. characterized by the fact 
that the said compound is chosen among the peptides, 
polypeptides and proteins. 

47 Process according to claim 45, characterized by the fact 
that the said compound is a compound regulating the 
transcription activity or the replication of DNA. 



4. 

„ to claim 40, characterized by the ract 
inscription activity or repUcatron of D»». 

, 2 . Proc ess accordin, to ». ^"^^r:^ 

that the said compound is chosen a»on, the soqoenc 



and RNA. 



• w the process according to claim 40 
4 3. Proteins obtained by the proce- 



or c j. a lin 
44 



. for the production of DNA , characterized by the 
44. Process for the pro t 

, . in th e same milieu, genes which are 
fact that, xn s svnthe tic polynucleotides 

Partially composec ot s.o.hast - introduced 
._ .v^rp^ ; that the gene- -~ ^ 

hi"."* US an a manner to produce an ensen^ie 
mto host cells bo as t0 

transformed host ceils, that these 
produce independent clones of the host cell P 

,q/r^ selection is carriea uu- o~ 
that screening and/or select! 

p „. £ „ki. to identify those host cel.s v.. , 
stochastic sequencB of ON, havxng at on ^ 

property, and that such DNA xs xsolated from 
cultures of the host cells. 

,. M to claim 44, characterized by the fact 

45 . Process according to claim , ^ 
that the said property xs the capacxt> 

compound . 

46 . „ according to cU, 45. ^<*"^»£ t ™ 

~a i c r-hosen among tne P e P Llu 
that the said compound is chosen 

polypeptides and proteins. 



48 Process according to cLaim ,narac*er.< -ed by the fact 
that the said compound is a regulatory protein controlling 
the transcription or replication of DNA. 

^ a cpcuence of DNA obtained by the process 

49 Utilization Oi a sequence oi 

accordin, to = 1.1- 46. or el.i. 47. »s a cl.-r.9ul.t0ry 

sequence of replication or transcription of a neighboring 

sequence of DNA. 

4 r~ r bi r characterized by the tact 

50 Process according to clc^m _^iauu 

that the proteins obtained have the capacity to modify the 
transcription activity, the replication, or the stability 
of DNA . 

• • * • r, of a n-ctein obtained by the process 
SI. Utilization oi a F* ultl 

T _ j .: * . , ^ franker lotion, 
according to claim 4b, to >— > — — - * 

replication or stability of a sequence of DNA in a ce.l 
containing this sequence of DNA and expressing this 
protein . 

oijf characterized by the 

52 Process for the production oo. ^a, c..a_ 

fact that, in the same milieu, genes which are at least 
partially composed of synthetic stochastic polynucleotides 
are produced simultaneously, that the genes thus obtamc. 
are introduced in host cells in a manner to produce an 
ensemble of transformed host cells, that the independent 
clones of transformed host cells so produced are grown 
simultaneously, that a screening and/or selection is 
carried out on this ensemble in a manner to identify those 
host cells containing stochastic sequences of RNA having at 
least one desired property, and that the RNA is isolated 
from the cultures of host cells so identified. 

53. Process according to claim 52, characterized by the fact 
that the said property is the capacity to bind to a given 
compound. 
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54. Process according to claim 52, characterized by the fact 
that the said property is the capacity to catalyse a given 

chemical reaction. 

55. Process according to claim 52, characterized by the fact 
that the said property is to be a transfer RNA. 



